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This invention relates to newly identified 
polynucleotides, polypeptides encoded by such 
polynucleotides, the use o£ such polynucleotides and 
polypeptides, as well as the production of such 
polynucleotides and polypeptides. More particularly, the 
polypeptide of the present invention is connective tissue 
growth factor-2 sometimes hereinafter referred to as "CTGP- 
2 " . The invention also relates to inhibiting the action of 
such polypeptides. 

The CTGF polypeptides are structurally and f iinctionally 
related to a family of growth factors which include IGF 
(insulin- like growth factor) , PDGF (platelet -derived growth 
factor) , and PGF (fibroblast growth factor) . This emerging 
family of secreted proteins are a group of cysteine -rich 
proteins. This group of growth factors are iirportant for 
normal growth, differentiation, morphogenesis of the 
cartilaginous skeleton of an embryo cind cell growth. Among 
some of the functions that have been discovered for these 
growth factors are wound healing, tissue repair, itrplant 
fixation and stimulating increased bone mass- 

The extended superf amily of growth factors include TGP 
(transforming growth factor) , bone morphogenic factors, and 
activins, among others. 
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The most well-known growth factor, T6F exerts a number 
of different effects on a variety of cells- For exainple, 
TGF-|8 can inhibit the differentiation of certain cells of 
mesodermal origin (Florini, J«R. et al., J> Biol> Chem. . 
261:1659-16513 (1986) induced the differentiation of others 
{Seyedine, S.M. et al., PNAS USA . 82:2267-2271 (1987) and 
potently inhibit proliferation of various types of epithelial 
cells, (Tucker, R.P., Science . 226:705-705 (1984)). This 
last activity has led to the speculation that one ittqportant 
physiological role for TGF-jS is to maintain the repressed 
growth state of many types of cells* Accordingly, cells that 
lose the ability to respond to T6F-jS are more likely to 
exhibit uncontrolled growth and become tumorigenic. 

Accordingly, due to amino acid sequence homology the 
polypeptide of the present invention is a member of this 
extended family of growth factors which has many effects on 
a variety of different tissues. 

In accordance with one aspect of the present invention, 
there is provided a novel mature polypeptide, as well as 
biologically active and diagnostically or therapeutically 
useful fragments, analogs and derivatives thereof* The 
polypeptide of the present invention is of human origin. 

In accordance with another aspect of the present 
invention, there are provided isolated nucleic acid molecules 
encoding a polypeptide of the present invention including 
mRNAs, DKAs, cDNAs, genomic DNAs as well as analogs and 
biologically active and diagnostically or therapeutically 
useful fragments thereof. 

In accordance with yet a further aspect of the present 
invention, there is provided a process for producing such 
polypeptide by recombinant techniques con5>rising culturing 
recombinant prokaryotic cuid/or eukaryotic host cells, 
containing a nucleic acid sequence encoding a polypeptide of 
the present invention, under conditions promoting expression 
of said protein and subsequent recovery of said protein. 
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In accordance with yet a further aspect of the present 
invention, there is provided a process for utilizing such 
polypeptide, or polynucleotide encoding such polypeptide for 
therapeutic purposes, for exan?)le, enhancing the repair of 
connective and support tissue, promoting the attachment, 
fixation and stabilization of tissue implants and enhancing 
woxmd healing. 

In accordance with yet a further aspect of the present 
invention, there are provided antibodies against such 
polypeptides . 

In accordance with yet a further aspect of the present 
invention, there is also provided nucleic acid probes 
comprising nucleic acid molecules of sufficient length to 
specifically hybridize to a nucleic acid sequence of the 
present invention. 

In accordance with yet a further aspect of the present 
invention, there are provided antibodies against such 
polypeptides . 

In accordance with another aspect of the present 
invention, there are provided agonists which mimic the 
polypeptide of the present invention and binds to the 
receptors . 

In accordance with yet another aspect of the present 
invention, there are provided antagonists to such 
polypeptides, which may be used to inhibit the action of such 
polypeptides, for example, in the treatment of CTGP dependent 
tximor growth. 

In accordance with still another aspect of the present 
invention, there are provided diagnostic assays for detecting 
diseases or susceptibility to diseases related to mutations 
in the nucleic acid secpiences encoding a polypeptide of the 
present invention. 

In accordance with yet a further aspect of the present 
invention, there is provided a process for utilizing such 
polypeptides, or polynucleotides encoding such polypeptides. 
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for in vitro puarposes related to scientific research, for 
exatrple, synthesis of DNA cuid manufactxire of DMA vectors. 

These and other aspects of the present invention should 
be apparent to those skilled in the art from the teachings 
herein . 

The following drawings are illustrative of eraboditnents 
of the invention and are not meant to limit the scope of the 
invention as enconpassed by the claims. 

Figtire 1 depicts the cDMA sequence and corresponding 
deduced amino acid sequence of CrGP-2, The standard one- 
letter abbreviation for amino acids is used. 

Figure 2 is an amino acid comparison between CT6F-2 
(top) and Cyr61 ((bottom) illustrating the amino acid 
sequence homology. 

Figure 3 is an amino acid coitparison between CT6F-2 
(top) and mouse CTGF (bottom) illustrating the amino acid 
sequence homology. 

In accordcince with an aspect of the present invention, 
there is provided an isolated nucleic acid (polynucleotide) 
which encodes for the mature polypeptide having the deduced 
amino acid sequence of Figure 1 (SEQ ID NO: 2) or for the 
mature polypeptide encoded by the cDNA of the clone deposited 
as ATCC Deposit No. 75804. 

The polynucleotide of this invention , was discovered in 
a cDNA library derived from Human fetal lung. It is 
struct\irally related to the IGF and PDGF family. It contains 
an open reading frame encoding a protein of approximately 375 
amino acid residues of which approximately the first 24 amino 
acids residues are the putative leader sequence such that the 
putative mature protein conqprises 351 amino acids. The 
protein exhibits the highest degree of homology to Mouse CTGF 
with 49% identity and 67% similarity and to Cyr61 with 89% 
identity and 93% similarity. C^61 is a growth factor- 
inducible immediate early gene initially identified in serum- 
stimulated mouse fibroblasts. It encodes a member of an 
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emerging family of cysteine-rich secreted proteins that 
includes a connective tissue growth factor (O'Brien and Lau, 
Cell Growth Differ., 3:645-54 (1992)). 

The polynucleotide of the present invention may be in 
the fom of RNA or in the form of DNA, which DNA. includes 
cDNA, genomic DNA, and synthetic DNA. The DNA may be double- 
stranded or single -stranded, and if single stranded may be 
the coding strand or non-coding (ant i -sense) strand. The 
coding sequence which encodes the mature polypeptide may be 
identical to the coding sequence shown in Figure 1 (SEQ ID 
NO:l) or that of the deposited clone or may be a different 
coding sequence which coding sequence, as a result of the 
redundancy or degeneracy of the genetic code, encodes the 
same mature polypeptide as the DNA of Figure 1 (SEQ ID N0:1) 
or the deposited cDNA. 

The polynucleotide which encodes for the mature 
polypeptide of Figure 1 (SEQ ID NO: 2) or for the mature 
polypeptide encoded by the deposited cDNA may include: only 
the coding sequence for the mature polypeptide; the coding 
secpxence for the mature polypeptide cmd additional coding 
sequence such as a leader or secretory sequence or a 
proprotein sequence; the coding sequence for the mature 
polypeptide (and optionally additional coding sequence) and 
non- coding sequence, such as introns or npn- coding sequence 
5' and/ or 3' of the coding sequence for the mature 
polypeptide . 

Thus, the term "polynucleotide encoding a polypeptide" 
encompasses a polynucleotide which includes only coding 
sequence for the polypeptide as well as a polynucleotide 
which includes additional coding and/or non- coding sequence. 

The present invention further relates to variants of the 
hereinabove described polynucleotides which encode for 
fragments, analogs and derivatives of the polypeptide having 
the deduced amino acid sequence of Figure 1 (SEQ ID NO: 2) or 
the polypeptide encoded by the cDNA of the deposited clone. 
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The variant of the polynucleotide may be a naturally 
occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. 

Thus, the present invention includes polynucleotides 
encoding the same mature polypeptide as shown in Figure 1 
(SEQ ID NO: 2) or the same mature polypeptide encoded by the 
cDNA of the deposited clone as well as variants of such 
polynucleotides which variants encode for a fragment, 
derivative or analog of the polypeptide of Figure l (SEQ ID 
NO: 2) or the polypeptide encoded by the cDNA of the deposited 
clone. Such nucleotide variants include deletion variants, 
substitution variants and addition or insertion variants. 

As hereinabove indicated, the polynucleotide may have a 
coding sequence which is a naturally occurring allelic 
variant of the coding sequence shown in Figure 1 (SEQ ID 
NO:l) or of the coding sequence of the deposited clone* As 
known in the art, an allelic variant is an alternate form of 
a polynucleotide sequence which may have a substitution, 
deletion or addition of one or more nucleotides, which does 
not sxibstantially alter the fxinction of the encoded 
polypeptide . 

The polynucleotides of the present invention may also 
have the coding sequence fused in frame to a marker sequence 
which allows for purification of the polypeptide of the 
present invention. The marker sequence may be a hexa- 
histidine . tag supplied by a pQE-9 vector to provide for 
purification of the mature polypeptide fused to the marker in 
the case of a bacterial host, or, for example, the marker 
sequence may be a hemagglutinin (HA) tag when a mammalian 
host, e.g. COS -7 cells, is used. The HA tag corresponds to 
an epitope derived from the influenza hemagglutinin protein 

{Wilson, I., et al.. Cell, 37:767 (1984)). 

The term "gene" means the segment of DMA involved in 
producing a polypeptide chain,- it includes regions preceding 
cind following the coding region (leader and trailer) as well 



as intervening sequences (introns) between individual coding 
segments (exons) * 

Fragments of the full length gene of the present 
invention may be used as a hybridization probe for a cDNA 
library to isolate the full length cDNA and to isolate other 
cDKAs which have a high sequence similarity to the gene or 
similar biological activity. Probes of this type preferably 
have at least 30 bases and may contain, for exanqple, 50 or 
more bases. The probe may also be used to identify a cDNA 
clone corresponding to a full length transcript and a genoxoic 
clone or clones that contain the complete gene including 
regulatory and promoter regions, exons, and introns. An 
exattple of a screen cottqprises isolating the coding region of 
the gene by using the known DNA sequence to synthesize an 
oligonucleotide probe. Labeled oligonucleotides having a 
sequence cott5>lementary to that of the gene of the present 
invention are used to screen a library of hiiman cDNA, genomic 
DNA or mRNA to determine which members of the library the 
probe hybridizes to. 

The present invention further relates to 
polynucleotides which hybridize to the hereinabove -described 
sequences if there is at least 90%, and more preferably at 
least 95% identity between the sequences. The present 
invention particularly relates to polynucleotides which 
hybridize under stringent conditions to the hereinabove- 
described. polynucleotides. As herein used, the term 
"stringent conditions" means hybridization will occur only if 
there is at least 95% and preferably at least 97% identity 
between the sequences. The polynucleotides which hybridize 
to the hereinabove described polynucleotides in a preferred 
embodiment encode polypeptides which either retain 
substantially the same biological function or activity as the 
mature polypeptide encoded by the cDNAs of Figure 1 (SEQ ID 
N0:1) or the deposited cDNA(s} . 



Alternatively, the polynucleotide may have at least 20 
bases, preferably 30 bases, and more preferably at least 50 
bases which hybridize to a polynucleotide of the present 
invention and which has an identity thereto, as hereinabove 
described, and which may or may not retain activity. For 
example, such polynucleotides may be employed as probes for 
the polynucleotide of SEQ ID N0:1, for exairple, for recovery 
of the polynucleotide or as a diagnostic probe or as a PGR 
primer , 

Thus, the present invention is directed to 
polynucleotides having at least a 90% and more preferably at 
least a 95% identity to a polynucleotide which encodes the 
polypeptide of SEQ ID NO: 2 as well as fragments thereof, 
which fragments have at least 30 bases and preferably at 
least 50 bases and to polypeptides encoded by such 
polynucleotides . 

The deposit (s) referred to herein will be maintained 
under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Micro-organisms for purposes of 
Patent Procedure. These deposits are provided merely as 
convenience to those of skill in the art and are not an 
admission that a deposit is required tmder 35 U.S.C. §112* 
The sequence of the polynucleotides contained in the 
deposited materials, as well as the amino, acid sequence of 
the polypeptides encoded thereby, are incorporated herein by 
reference, and are controlling in the event of any conflict 
with any description of sequences herein. A license may be 
required to make, use or sell the deposited materials, and 
no such license is hereby granted. 

The present invention further relates to a polypeptide 
which has the deduced amino acid sequence of Figure 1 (SEQ ID 
NO: 2) or which has the amino acid sequence encoded by the 
deposited cDNA, as well as fragments, analogs and derivatives 
of such polypeptide. 
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The terms »»fragment, " "derivative" and "analog" when 
referring to the polypeptide of Pigiire 1 {SEQ ID NO: 2) or 
that encoded by the deposited cBSk, means a polypeptide which 
retains essentially the same biological function or activity 
as such polypeptide. Thus, an analog includes a proprotein 
which can be activated by cleavage of the proprotein portion 
to produce an active mature polypeptide. 

The polypeptide of the present invention may be a 
recombinant polypeptide, a natural polypeptide or a synthetic 
polypeptide, preferably a recombinant polypeptide. 

The fragment, derivative or analog of the polypeptide 
af Figure 1 (SEQ ID NO: 2) or that encoded by the deposited 
cDNA may be (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non- conserved 
amino acid residue (preferably a conserved amino acid 
residue) and such substituted amino acid residue may or may 
not be one encoded by the genetic code, or (ii) one in which 
one or more of the amino acid residues includes a substituent 
group, or (iii) one in which the mature polypeptide is fused 
with another coirpound, such as a confound to increase the 
half -life of the polypeptide (for example, polyethylene 
glycol) , or (iv) one in which the additional amino acids are 
fused to the mature polypeptide, such as a leader or 
secretory sequence or a sequence which is eTt5>loyed for 
purification of the mature polypeptide or a proprotein 
sequence. - Such fragments, derivatives and analogs are deemed 
to be within the scope of those skilled in the art from the 
teachings herein. 

* 

The polypeptides and polynucleotides of the present 
invention are preferably provided in an isolated form, and 
preferably are purified to homogeneity. 

The term "isolated" mecins that the material is removed 
from its original environment (e.g., the natural environment 
if it is natTirally occurring) . For example, a naturally- 
occurring polynucleotide or polypeptide present in a living 
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animal is not isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the coexisting 
materials in the natiiral system, is isolated* Such 
polynucleotides could be part of a vector and/or such 
polynucleotides or polypeptides could be part of a 
composition, and still be isolated in that such vector or 
composition is not part of its natural environment. 

The polypeptides of the present invention include the 
polypeptide of SEQ ID NO: 2 (in particular the mature 
polypeptide) as well as polypeptides which have at least 70% 
similarity {preferably at least 70% identity) t:o the 
polypeptide of SEQ ID NO: 2 and more preferably at least 90% 
similarity {more preferably at least 90% identity) to the 
polypeptide of SEQ ID NO: 2 and still more preferably at least 
95% similarity (still more preferably at least 95% identity) 
to the polypeptide of SEQ ID NO: 2 and also include portions 
of such polypeptides with such portion of the polypeptide 
generally containing at least 30 amino acids and more 
preferably at least 50 amino acids • 

As known in the art "similarity" between two 
polypeptides is determined by comparing the amino acid 
sequence and its conserved amino acid substitutes of one 
polypeptide to the sequence of a second polypeptide. 

Fragments or portions of the polypeptides of the present 
invention may be en5)loyed for producing the corresponding 
full-length polypeptide by peptide synthesis; therefore, the 
fragments may be employed as intermediates for producing the 
full-length polypeptides. Fragments or portions of the 
polynucleotides of the present invention may be used to 
synthesize full-length polynucleotides of the present 
invention . 

The present invention also relates to vectors which 
include polynucleotides of the present invention, host cells 
which are genetically engineered with vectors of the 
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invention and the production of polypeptides of the invention 
by recombinant techniques- 

Host cells are genetically engineered (transduced or 
transformed or transfected) with the vectors of this 
invention which may be, for exaTt5)le, a cloning vector or an 
expression vector. The vector may be, for example, in the 
form of a plasmid, a viral particle, a phage, etc. The 
engineered host cells can be cultured in conventional 
nutrient media modified as appropriate for activating 
promoters , selecting transf ormants or amplifying the genes of 
the present invention. The culture conditions ^ such as 
tett5>eratur e , pH and the like, are those previously used with 
the host cell selected for expression, and will be apparent 
to the ordinarily skilled artisan. 

The polynucleotides of the present invention may be 
ert5)loyed for producing polypeptides by recombinant 
techniques. Thus, for example, the polynucleotide may be 
included in any one of a variety of expression vectors for 
expressing a polypeptide. Such vectors dlnclude chromosomal, 
nonchromosomal and synthetic DNA sequences , e.g., derivatives 
of SV40; bacterial plasmids; phage DNA; baculovirus; yeast 
plasmids; vectors derived from combinations of plasmids and 
phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox 
virus, and pseudorabies . However, cuiy other vector may be 
used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the 
vector by a variety of procedures. In general, the DNA 
sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. Such 
procedures and others are deemed to be within the scope of 
those skilled in the art. 

The DNA sequence in the expression vector is operatively 
linked to an appropriate expression control sequence (s) 
(promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or 
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SV40 promoter, the E. coli> lac or trp . the phage lambda Pl 
promoter and other promoters known to control e3q)ression of 
genes in prokaryotic or eukaryotic cells or their viruses. 
The expression vector also contains a ribosome binding site 
for translation initiation and a transcription terminator. 
The vector may also include appropriate sequences for 
amplifying expression. 

In addition, the expression vectors preferably contain 
one or more selectable marker genes to provide a phenotypic 
trait for selection of transformed host cells such as 
dihydrof olate reductase or neomycin resistance for exikaryotic 
cell culture, or such as tetracycline or ampicillin 
resistance in E . coli > 

The vector containing the appropriate DNA sequence as 
hereinabove described, as well as an appropriate promoter or 
control sequence, may be employed to transform an appropriate 
host to permit the host to express the protein. 

As representative examples of appropriate hosts, there 
may be mentioned: bacterial cells, such as E. coli , 
Streptomvces , Salmonella tvphimurium ; fungal Cells, such as 
yeast; insect cells such as Drosophila S2 and Spodoptera Sf 9 : 
animal cells such as CHO , COS or Bowes melanoma ; 
adenoviruses; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those 
skilled in the art from the teachings herein. 

More -particularly, the present invention also includes 
recombinant constructs comprising one or more of the 
sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into 
which a sequence of the invention has been inserted, in a 
forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct further conprises regulatory 
sequences, including, for example, a promoter, operably 
linked to the sequence. Large numbers of suitable vectors 
and promoters are known to those of skill in the art, and are 
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commercially available. The following vectors are provided 
by way of example; Bacterial: pQE70, pQESO, pQE-9 (Qiagen) , 
pBS, pDlO, phagescript, psiX174, pbluescript SK, pbsks, 
pNHSA, pNHlSa, pNHlSA, pNH46A (Stratagene) ; ptrc99a, pKK223- 
3, pKK233-3, pDR540, pRIT5 (Pharmacia); Eiakaryotic: pWLNBO, 
PSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, 
pSVL (Pharmacia) . However , any other plasmid or vector may 
be used as long as they are replicable and viable in the 
host. 

Promoter regions can be selected from any desired gene 
using CAT (chloran5>henicol transferase) vectors or other 
vectors with selectable markers • Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda Pr, P^ and trp. 
Eiokaryotic promoters include CMV immediate early, HSV 
thymidine kinase, early and late SV40, LTRs from retrovirus, 
and mouse metallothionein-I, Selection of the appropriate 
vector and promoter is well within the level of ordinaary 
skill in the art. 

In a further embodiment, the present invention relates 
to host cells containing the above-described constructs. The 
host cell can be a higher eukaryotic cell^ such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast 
cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the construct into the host 
cell ccui be effected by calcixm phosphate transf ection, DEAE- 
Dextran mediated transfection, or electroporation (Davis, L., 
Dibner, M., Battey, I., Basic Methods in Molecular Biology, 
(1986)) . 

The constructs in host cells can be used in a 
conventional manner to produce the gene product encoded by 
the recombinant sequence. Alternatively, the polypeptides of 
the invention can be synthetically produced by^ conventional 
peptide synthesizers. 
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Mature proteins can be expressed in tnammalian cells , 
yeast, bacteria, or other cells under the control of 
appropriate promoters. Cell-free translation systems can 
also be enployed to produce such proteins using RNAs derived 
from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with 
prokaryotic and etikaryotic hosts are described by.Sambrook, 
et al.. Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor, N.Y., (1989), the disclosure of 
which is hereby incorporated by reference. 

Transcription of the DNA encoding the polypeptides of 
the present invention by higher eiikaryotes is increased by 
inserting an enhancer sequence into the vector. Enhancers 
are cis -acting elements of DNA, usually about from 10 to 300 
bp that act on a promoter to increase its transcription. 
Exairples include the SV40 enhancer on the late side of the 
replication origin bp 100 to 270, a cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of 
the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include 
origins of replication and selectable markers permitting 
transformation of the host cell, e.g., the an5)icillin 
resistance gene of E . coli and S , cerevisiae TRPl gene, and 
a promoter derived from a highly-expressed gene to direct 
transcription of a downstream structural sequence. Such 
promoters - can be derived from operons encoding glycolytic 
enzymes such as 3 -phosphoglycerate kinase (PGK) , of-f actor, 
acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate 
phase with translation initiation and termination sequences, 
and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or 
extracellular medium. Optionally, the heterologous sequence 
can encode a fusion protein including an N-terminal 
identification peptide iTi5)arting desired characteristics, 

-14- 



1 

e.g., stabilization or sinplified purification of expressed 
recombinant product. 

Useful expression vectors for bacterial use are 
constructed by inserting a structural DNA sequence encoding 
a desired protein together with suitable translation 
initiation and termination signals in operable reading phase 
with a functional promoter. The vector will comprise one or 
more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector and to, if 
desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli . 
Bacillus subtilis . Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and 
Staphylococcus, although others may also be employed as a 
matter of choice. 

As a representative but nonlimiting example, useful 
expression vectors for bacterial use can congprise a 
selectable marker and bacterial origin of replication derived 
from commercially available plasmids corcprising genetic 
elements of the well known cloning vector pBR322 {ATCC 
37017) . Such commercial vectors include, for exanple, 
pKK223-3 (Pharmacia Fine Oiemicals, XJppsala, Sweden) and GEMi 
(Promega Biotec, Madison, WI, USA) . These pBR322 "backbone" 
sections are combined with an appropriate promoter and the 
structural sequence to be expressed. 

Following transf ornaation of a suitable host strain and 
growth of the host strain to an appropriate cell density, the 
selected promoter is induced by appropriate means (e.g., 
tett5)erature shift or chemical induction) and cells are 
cultured for an additional period. 

Cells are typically harvested by centrifugation, 
disrupted by physical or chemical mecins, and the resulting 
crude extract retained for further purification* 

Microbial cells employed in expression of proteins can 
be disrupted by any convenient method, including freeze-thaw 
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cycling, sonication, mechanical disruption, or use of cell 
lysing agents, such methods are well knovm to those skilled 
in the art . 

Various mammalian cell culture systems can also be 
employed to eaqpress recombinant protein. Examples of 
mammalian expression systems include the COS -7 lines of 
monkey kidney fibroblasts, described by Gluzman, Cell, 23:175 
(1981) , and other cell lines capable of expressing a 
compatible vector, for exaTi:?)le, the C127, 3T3, CHO, HeLa and 
BHK cell lines. Mammalian expression vectors will comprise 
an origin of replication, a suitcible promoter and enhancer, 
and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, 
transcriptional teinrnination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the 
SV40 splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. 

The polypeptide Ccua be recovered and purified from 
recombinant cell culttires by methods including ammonium 
sulfate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature 
protein. JPinally, high perfoannance liquid chromatography 
(HPLC) can be eTt5)loyed for final purification steps. 

The polypeptides of the present invention may be a 
naturally purified product, or a product of chemical 
synthetic procedures, or produced by recombinant techniques 
from a prokaryotic or eukaaryotic host (for exanple, by 
bacterial, yeast, higher plant, insect and mammalian cells in 
culture) . Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present 
invention may be glycosylated or may be non-glycosylated. 
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Polypeptides of the invention may also include an initial 
methionine amino acid residue. 

The CT6F-2 polypeptides of the present invention may be 
en?)loyed to enhance the repair of connective and support 
tissue. For exan?)le crGF-2 may be used to treat skin 
disorders such as inj\iries, acne, aging, UV damage or bums. 
CTGP-2 may also be employed, to improve the cosmetic 
appearance of the skin, for example, by treating wrinkled 
skin. 

CTGF-2 may also be employed to promote the attachment, 
fixation and stabilization of tissue in?)lants, for example, 
a- prosthesis and other implants inserted during 
reconstructive surgery. The CTGF-2 polypeptide of the 
present invention may be eii?>loyed in the healing of external 
wounds, by promoting growth of epithelial and connective 
tissues and the synthesis of total protein and collagen. 
CTGF-2 may be applied in the area of injtired or depleted 
bones , with regeneration occxirring by promoting the growth of 
connective tissue, bone and cementum and by stimulating 
protein and collagen synthesis which is especially useful for 

periodontal disease. 

This invention provides a method for identification of 
the receptor for the CTGF-2 polypeptide. The gene encoding 
the receptor can be identified by esqpression cloning. 
Briefly, polyadenylated RNA is prepared from a cell 
responsive to CT6F-2, and a cDNA library created from this 
RNA is divided into pools and used to transf ect COS cells or 
other cells that are not responsive to CTGP-2. Transf ected 
cells which are grown on glass slides are exposed to labeled 
CTGF-2. Hxb CTGF-2 can be labeled by a variety of means 
including iodidation or inclusion of a recognition site for 
a site -specific protein kinase. Following fixation and 
incubation, the slides are subjected to autoradiographic 
analysis. Positive pools are identified and sub-pools are 
prepared and retransfected using an iterative sub-pooling and 
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rescreening process, eventually yielding a single clone that 
encodes the putative receptor. As an alternative approach 
for receptor identification, labeled ligand can be 
photoaffinity linked with cell membrane or extract 
preparations that express the receptor molecule « Cross- 
linked material is resolved by PAGE and exposed to x-ray 
film. The Icd^eled coti5>lex containing the CT6P- 2 -receptor can 
be excised, resolved into peptide fragments, and subjected to 
protein microseguencing. The amino acid sequence obtained 
from microseguencing would be used to design a set of 
generate oligonucleotide probes to screen a cDNA library to 
identify the gene encoding the putative receptor. 

This invention also provides a method of screening 
corapoixnds to identify those which bind to the crGF-2 receptor 
and elicit a second messenger response (agonists) or do not 
elicit a second messenger response (antagonists) . As an 
exaxiple, a mammalian cell or membrane preparation expressing 
the CTGF-2 receptor would be incubated with a labeled 
coitpound. The response of a known second messenger system 
following interaction of the compound and the CTGF-2 receptor 
is then measured. Such second messenger systems include but 
are not limited to, cAMP guanylate cyclase, ion channels or 
phosphoinositide hydrolysis . 

The present invention is also directed to antagonists 
molecules of the polypeptides of the present invention, cuid 
their use -in reducing or eliminating the function of CTGF-2. 

An example of an antagonist is an antibody or in some 
cases, an oligonucleotide, which binds to the CTGF-2 
polypeptide. Alternatively, antagonists include closely 
related proteins that have lost biological function and 
thereby prevent the action of CTGP-2 since receptor sites are 
occupied . 

Antisense technology may be employed to decrease the 
level of In vivo circulation of CTGF-2. Antisense technology 
can be used to control gene expression through triple -helix 
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formation or antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For 
exattqple, the 5' coding portion of the polynucleotide 
sequence, which encodes for the mature polypeptides of the 
present invention, is used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. 
A DNA oligonucleotide is designed to be con^lementary to a 
region of the gene involved in transcription (triple helix - 
see Lee et al*, Nucl. Acids Res., 6:3073 (1979); Cooney et 
al. Science, 241:456 (1988); and Dervan et al., Science, 251: 
1360 (1991) ) , thereby preventing transcription and the 
production of CT6F-2* The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and bloclcs translation of the 
mRNA molecule into CT6F-2 (antisense - Okano, J. Neurochem. , 
56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors 
of Gene Expression, CRC Press, Boca Raton, FL (1988)). The 
oligonucleotides described above can also be delivered to 
cells such that the antisense RNA or DNA may be expressed ixz 
vivo to inhibit production of CTGF-2. 

Another exanple of an antagonist is a small molecule 
which binds to the crGF-2 receptors such that normal 
biological activity is prevented • Examples of small 
molecules include but are not limited to small peptides or 
peptide-like molecules. 

The antagonists may be employed to prevent scar 
formation . due to excess proliferation of connective tissues 
and to prevent CT6F-2 dependent tumor growth. The 
antagonists may be eit5)loyed in a composition with a 
pharmaceutically acceptable carrier, e.g., as hereinabove 
described. 

The present invention also relates to an assay for 
identifying potential antagonists specific to CTGF-2. An 
exaiiqple of such an assay combines CT6F-2 and^a potential 
antagonist with membrane -bound CT6F-2 receptors or 
recombinant CTGF-2 xrnder appropriate conditions for a 
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coTtqpetitive inhibition assay. CTGF-2 can be labeled, such as 
by radio activity, such that the nximber of CT6F-2 molecules 
bound to the receptor can determine the effectiveness of the 
potential antagonist. 

The polypeptides and antagonists and agonists of the 
present invention may be enployed in combination with a 
suitable pharmaceutical carrier. Such coirpositions comprise 
a therapeutically effective amount of the polypeptide, 
agonist or antagonist, and a pharmaceutically acceptable 
carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, 
glycerol, ethanol, and combinations thereof. The formulation 
should suit the mode of administration. 

The invention also provides a pharmaceutical pack or kit 
con5)rising one or more containers filled with one or more of 
the ingredients of the pharmaceutical ccnnpositions of the 
invention. Associated with such container (s) can be a notice 
in the form prescribed by a governmental agency regulating 
the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. In 
addition, the polypeptides, agonists and antagonists of the 
present invention may be employed in conjunction with other 
therapeutic compounds. 

The phaxmaceutical cotcpositions may be administered in 
a convenient manner such as by the topical, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or 
intradermal routes. CT6F-2 is administered in an amount 
which is effective for treating and/or prophylaxis of the 
specific indication. In general, CTGF-2 will be administered 
in an amo\mt of at least about 10 ^g/kg body weight cuid in 
most cases crGF-2 will be administered in an amount not in 
excess of about 8 mg/Kg body weight per day. In most cases, 
the dosage is from about 10 ^tg/kg to about 1 mg/kg body 
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weight daily, taking into accoiint the routes of 
administration , syii^toms , etc * 

The polypeptides tnay also be employed in accordance with 
the present invention by expression of such polypeptides in 
vivo, which is often referred to as "gene therapy, " 

Thus, for exanqple, cells from a patient may be 
engineered with a polynucleotide (DNA or RNA) encoding a 
polypeptide ex vivo, with the engineered cells then being 
provided to a patient to be treated with the polypeptide* 
Such methods are well-known in the art. For exatiqple, cells 
may be engineered by procedures known in the art by use of a 
retroviral particle containing KN& encoding a polypeptide of 
the present invention. 

Similarly, cells may be engineered in vivo for 
expression of a polypeptide in vivo by, for exanqple, 
procedures known in the art. As known in the art, a producer 
cell for producing a retroviral particle containing RNA 
encoding the polypeptide of the present invention may be 
administered to a patient for engineering cells in vivo and 
expression of the polypeptide in vivo. These and other 
methods for administering a polypeptide of the present 
invention by such method should be apparent to those skilled 
in the art from the teachings of the present invention. For 
exaxnple, the expression vehicle for engineering cells inay be 
other than a retrovirus, for exan^le, an adenovirus which may 
be used to engineer cells in vivo after combination with a 
suitable delivery vehicle. 

Retroviruses from which the retroviral plasmid vectors 
hereinabove mentioned may be derived include, but are not 
limited to, Moloney Murine Leukemia Virus, spleen necrosis 
virus, retroviruses such as Rous Sarcoma virus, Harvey 
Sarcoma Virus, avian leukosis virus, gibbon ape lexikemia 
virus, himian immunodeficiency virus, adenovirus. 
Myeloproliferative Sarcoma Virus, and mammary tumor virus. 
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In one embodiment, the retroviral plasmid vector is derived 
from Moloney Murine Lexikemia Virus. 

The vector includes one or more promoters. Suitable 
promoters which may be enqployed include, but are not limited 
to, the retroviral LTR; the SV40 promoter; and the human 
cytomegalovirus (CMV) promoter described in Miller, et al., 
Biotechnicnies . Vol. 7, No. 9, 980-990 (1989), or any other 
promoter (e*g., cellular promoters such as exikaryotic 
cellular promoters including, but not limited to, . the 
histone, pol III, cuid j8-actin promoters) . Other viral 
promoters which may be employed include, but are not limited 
to, adenovirus promoters, thymidine kinase (TK) promoters, 
and B19 parvovirus promoters* The selection of a suitable 
promoter will be apparent to those skilled in the art from 
the teachings contained herein. 

The nucleic acid sequence encoding the polypeptide of 
the present invention is under the control of a suitable 
promoter. Suitable promoters which may be eTt5)loyed include, 
but are not limited to, adenoviral promoters, such as the 
adenoviral major late promoter; or heterologous promoters, 
such as the cytomegalovirus (CMV) promoter; the respiratory 
syncytial virus (RSV) promoter; inducible promoters, such as 
the MMT promoter, the metallothionein promoter; heat shock 
promoters; the albxxmin promoter; the ApoAI promoter; hi:mian 
globin promoters; viral thymidine kinase promoters, such as 
the Herpes Sitiqplex thymidine kinase promoter; retroviral LTRs 
(including the modified retroviral LTRs hereinabove 
described) ; the (3 -act in promoter; and human growth hormone 
promoters. The promoter also may be the native promoter 
which controls the gene encoding the polypeptide . The 
retroviral plasmid vector is enqoloyed to transduce packaging 
cell lines to form producer cell lines. Exatrqples of 
packaging cells which may be transfected include, but are not 
limited to, the PE501, PA317, ^-2, ^^-AM, PA12, T19-14X, VT- 
19-17-H2, ^^CRE, ^^CRIP, GP+E-86, 6P+envAml2, and DAN cell 
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lines as described in Miller, Human Gene Therapy . Vol. 1, 
pgs. 5-14 (1990), which is incorporated herein by reference 
in its entirety. The vector may transduce the packaging 
cells through any means known in the art. Such means 
include, but are not limited to, electroporation, the use of 
liposomes, and CaP04 precipitation. In one alternative, the 
retroviral plasmid vector may be encapsulated into a 
liposome, or coupled to a lipid, emd then administered to a 
host. 

The producer cell line generates inf ectioixs retroviral 
vector particles which include the nucleic acid sequence (s) 
encoding the polypeptides. Such retroviral vector particles 
then may be enployed, to transduce eukaryotic cells, either 
in vitro or in vivo. The transduced eukaryotic cells will 
express the nucleic acid sequence (s) encoding the 
polypeptide. Eukaryotic cells which may be transduced 
include, but are not limited to, embryonic stem cells, 
embryonic carcinoma cells, as well as hematopoietic stem 
cells, hepatocytes, fibroblasts, myoblasts, keratinocytes , 
endothelial cells, and bronchial epithelial cells. 

This invention is also related to the use of the gene of 
the present invention as a diagnostic. Detection of a 
mutated form of the gene will allow a diagnosis of a disease 
or a susceptibility to a disease which results from 
imderexpression of CTGP-2. 

Individuals carrying mutations in the gene of the 
present invention may be detected at the DNA level by a 
variety of techniques. Nucleic acids for diagnosis may be 
obtadLned from a patient ' s cells , including but not limited to 
blood, urine, saliva, tissue biopsy and autopsy material. 
The genomic DNA may be used directly for detection or may be 
atcplified enzymatically by using PGR (Saiki et al.. Nature, 
324:163-166 (1986)) prior to analysis. RNA or cDNA may also 
be used for the same purpose* As an example, PGR primers 
coirplementary to the nucleic acid encoding CTGF-2 can be used 



-23 



J 

7 

to identify and analyze mutations. For exatrple, deletions 
and insertions can be detected by a change in size of the 
atr^ylified product in cotrparison to the normal genotype. 
Point mutations can be identified by hybridizing atiplified 
DNA to radiolabeled RNA or alternatively, radiolabeled 
antisense D1C\ sequences* Perfectly matched sequences can be 
distinguished from mismatched duplexes by RNase A digestion 
or by differences in melting tenqperatures . 

Sequence differences between the reference gene and 
genes having mutations may be revealed by the direct DNA 
sequencing method. In addition, cloned DNA segments may be 
employed as probes to detect specific DNA segments. The 

♦ 

sensitivity of this method is greatly enhanced when combined 
with PCR. For example, a sequencing primer is used with 
donble-stranded PCR product or a single -stranded tetnplate 
molecule generated by a modified PCR. The sequence 
determination is performed by conventional procedures with 
radiolabeled nucleotide or by automatic sequencing procedures 
with fluorescent -tags. 

Genetic testing based on DNA secjuence dif f isrences may be 
achieved by detection of alteration in electrophoretic 
mobility of DNA fragments in gels with or without denaturing 
agents. Small sequence deletions and insertions can be 
visualized by high resolution gel electrophoresis. , DNA 
fragments of different sequences may be distinguished on 
denaturing f ormamide gradient gels in which the mobilities of 
different DNA fragments are retarded in the gel at different 
positions according to their specific melting or partial 
melting ten5>eratures (see, e.g., Myers et al». Science, 

230:1242 (1985)). 

Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such as RNase and SI 
protection or the chemical cleavage method (e.g.. Cotton et 

al., PNAS, USA, 85:4397-4401 (1985)). 
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Thus , the detection of a specific DNA sequence may be 
achieved by methods such as hybridization, SNase protection, 
chemical cleavage, direct DNA sequencing or the use of 
restriction enzymes, (e.g*, Restriction Fragment Length 
Polymo3::phisms (RFLP) ) and Southern blotting of genomic DNA.. 

in addition to more conventional gel-electrophoresis and 
DNA sequencing, mutations can also be detected by in situ 
analysis . 

The present invention also relates to a diagnostic assay 
for detecting altered levels of the polypeptide of the 
present invention in various tissues since an over -express ion 
of the proteins compared to normal control tissue samples can 
detect the presence of disorders of the host. Assays used to 
detect levels of the polypeptide of the present invention in 
a sai^ple derived from a host are well-known to those of skill 
in the art and include radioimmunoassays, cottpetitive-binding 
assays, Western Blot analysis and preferably an ELISA assay. 
An ELISA assay initially conprises preparing an antibody 
specific to the Cr6F-2 antigen, preferably a monoclonal 
antibody. In addition a reporter antibody is prepared 
against the monoclonal antibody. To the reporter antibody is 
attached a detectable reagent such as radioactivity, 
fluorescence or in this exaiiple a horseradish peroxidase 
enzyme. A sample is now removed from a host and inciibated on 
a solid support, e.g. a polystyrene dish, that binds the 
proteins in the sanple. Any free protein binding sites on 
the dish are then covered by incubating with a non-specific 
protein such as bovine serum albumin. Next, the monoclonal 
antibody is incubated in the dish during which time the 
monoclonal antibodies attached to the polypeptide of the 
present invention attached to the polystyrene dish. All 
unbound monoclonal antibody is washed out with buffer. The 
reporter antibody linked to horseradish peroxidase is now 
placed in the dish resulting in binding of the reporter 
antibody to any monoclonal antibody bound to the polypeptide 
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of the present invention. Unattached reporter antibody is 
then washed out. Peroxidase siibstrates are then added to the 
dish and the amount of color developed in a given time period 
is a measurement of the amoimt of the polypeptide of the 
present invention present in a given volume of patient satcple 
when coizpared against a standard cuorve. 

A conqpetition assay may be employed wherein antibodies 
specific to the polypeptide of the present invention are 
attached to a solid support and labeled CTGF-2 and a sanqple 
derived from the host are passed over the solid support and 
the amount of label detected attached to the solid support 
can be correlated to a quantity of the polypeptide of the 
present invention in the sample* 

The sequences of the present invention are also valuable 
for chromosome identification. The sequence is specifically 
targeted to and can hybridize with a particular location on 
an individual human chromosome. Moreover, there is a current 
need for identifying particular sites on the chromosome. Few 
chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking 
chromosomal location « The mapping of DNA.S to chromosomes 
according to the present invention is an important first step 
in correlating those sequences with genes associated with 
disease. 

Briefly, sequences Ccui be mapped to chromosomes by 
preparing. PGR primers (preferably 15-25 bp) from the cDNA. 
Computer analysis of the 3' iintranslated region of the gene 
is used to rapidly select primers that do not spam more than 
one exon in the genomic DNA, thus complicating the 
amplification process. These primers are then used for PGR 
screening of somatic cell hybrids containing individxial human 
chromosomes. Only those hybrids containing the himian gene 
corresponding to the primer will yield an attqplified fragment. 

PGR mapping of somatic cell hybrids is a rapid procedure 
for assigning a particular BNA to a particular chromosome. 
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Using the present invention with the same oligonucleotide 
primers, sublocalization can be achieved with panels of 
fragments from specific chromosomes or pools of large genomic 
clones in an analogous manner. Other mapping strategies that 
can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow- sorted 
chromosomes and preselection by hybridization to construct 
chromosome specif ic-cDNA libraries. 

Fluorescence in situ hybridization (PISH) of a cDNA 
clone to a metaphase chromosomal spread can be used to 
provide a precise chromosomal location in one step. This 
technique can be used with CDNA having at least 50 or 60 
bases. For a review of this technique, see Verma et al., 
H\xman Qiromosomes: a Manual of Basic Techniques, Pergamon 
Press, New York (1988). 

Once a sequence has been mapped to a precise chromosomal 
location, the physical position of the sequence on the 
chromosome can be correlated with genetic map data. Such 
data are found, for example, in V. McKusick, Mendelian 
Inheritance in Man (available on line through Johns Hopkins 
University Welch Medical Library) . The relationship between 
genes and diseases that have been mapped to the same 
chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes) . 

Next, it is necessary to determine the differences in 
the cDNA or genomic sequence between affected and unaffected 
individuals. If a mutation is observed in some or all of the 
affected individuals but not in any noirmal individuals, then 
the mutation is likely to be the causative agent of the 
disease . 

With cxxrrent resolution of physical mapping and genetic 
mapping techniques, a cDNA precisely localized to a 
chromosomal region associated with the disease could be one 
of between 50 and 500 potential causative genes. (This 
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assumes 1 megabase mapping resolution and one gene per 20 
kb) . 

The polypeptides, their fragments or other derivatives, 
or analogs thereof, or cells expressing them can be used as 
an immunogen to produce antibodies thereto* These antibodies 
can be, for exanple, polyclonal or monoclonal antibodies. 
The present invention also includes chimeric, single chain, 
and hi:ananized antibodies, as well as Fab fragments, or the 
product of an Fab expression library. Various procedures 
known in the art may be used for the production of such 
antibodies and fragments* 

Antibodies generated against the polypeptides 
corresponding to a sequence of the present invention can be 
obtained by direct injection of the polypeptides into an 
animal or by administering the polypeptides to an animal, 
preferably a nonhuman. The antibody so obtained will then 
bind the polypeptides itself. In this manner, even a 
sequence encoding only a fragment of the polypeptides can be 
used to generate antibodies binding the whole native 
polypeptides* Such antibodies can then be used to isolate 
the polypeptide from tissue expressing that polypeptide. 

For preparation of monoclonal antibodies, any technique 
which provides antibodies produced by continuous cell line 
cultures can be used. Examples include the hybridoma 
technique (Kohler and Milstein, 1975, Nature, 256:495-497), 
the trioma technique, the human B-cell hybridoma technique 
(Kozbor et al., 1983, Immunology Today 4:72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies 
(Cole, et al., 1985, in Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain 
antibodies (U*S. Patent 4,946,778) can be adapted to produce 
single chain antibodies to immunogenic polypeptide products 
of this invention. Also, transgenic mice may be used to 
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express humanized antibodies to immunogenic polypeptide 
products of this invention. 

The present invention will be further described with 
reference to the following exattples; however, it is to be 
understood that the present invention is not limited to such 
exanples. All parts or amounts, unless otherwise specified, 
are by weight. 

In order to facilitate understanding of the following 
exaitples certain frequently occurring methods and/ or terms 
will be described. 

"Plasmids" are designated by a lower case p preceded 
and/or followed by capital letters and/or numbers. The 
starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be 
constructed from available plasmids in accord with published 
procedures. In addition, equivalent plasmids to those 
described are known in the art and will be apparent to the 
ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the 
DNA with a restriction enzyme that acts only at certain 
sequences in the DNA. The various restriction enzymes used 
herein are commercially available and their reaction 
conditions, cof actors and other requirements were used as 
would be known to the ordinarily skilled artisan. For 
analytical purposes, typically 1 ng of plasmid or DNA 
fragment is used with about 2 imits of enzyme in about 20 /il 
of buffer solution. For the purpose of isolating DNA 
fragments for plasmid const3aiction, typically 5 to 50 /zg of 
DNA are digested with 20 to 250 lanits of enzyme in a larger 
volume. Appropriate buffers and substrate amounts for 
particular restriction enzymes are specified by the 
manufacturer. Incubation times of about 1 hour at 37 'C are 
ordinarily used, but may vary in accordance with the 
supplier's instructions. After digestion the reaction is 
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electrophoresed directly on a polyacrylamide gel to isolate 
the desired fragment. 

Size separation of the cleaved fragments is performed 
using 6 percent polyacrylamide gel described by Goeddel, D. 
et al., Nucleic Acids Res., 8:4057 (1980). 

"Oligonucleotides" refers to either a single stranded 
polydeoxynucleotide or two conplementary polydeoxynucleotide 
strands which may be chemically synthesized. Such synthetic 
oligonucleotides have no 5' phosphate and thus will not 
ligate to another oligonucleotide without adding a phosphate 
with cm ATP in the presence of a kinase. A synthetic 
aligonucleotide will ligate to a fragment that has not been 
dephosphorylated . 

"Ligation" refers to the process of forming 
phosphodi ester bonds between two double stranded nucleic acid 
fragments (Maniatis, T., et al., Id., p. 146). Unless 
otherwise provided, ligation may be accomplished using known 
buffers and conditions with 10 units to T4 DNA ligase 
("ligase") per 0.5 /ig of approximately equimolar amoxints of 
the HNA fragments to be ligated. 

Unless otherwise stated, transformation was performed as 
described in the method of Graham, F. and Van der Eb, A., 

Virology, 52:456-457 (1973). 

Example 1 

Cloning and expression of crGF-2 in a baculovims expression 
system 

The DNA sequence encoding the full length CT6F-2 protein, 
ATCC # 75804, was art5)lified using PCR oligonucleotide primers 
corresponding to the 5' and 3' sequences of the gene: 
The 5' primer has the sequence 06CX3G6ATCCTGC6CGACftCAATGA6CT 
(SEQ ID NO: 3) and contains a BamHI restriction enzyme site 
(in bold) followed by 18 nucleotides resembling an efficient 
signal for the initiation of translation in eukaryotic cells 
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(J. Mol. Biol. 1987, 19i# 947-950, Kozak, M J . The 
initiation codon for tremslation is underlined. 

The 3' primer has the sequence 
aSCGGGTACCAGGTAGCATTTAGTCCCTAA {SEQ ID NO: 4) and contains 
the cleavage site for the restriction endonuclease Asp781 and 
20 nucleotides complementary to the 3' non- translated 
sequence of the CTGF-2 gene. The amplified sequences were 
isolated from a 1% agarose gel using a commercially availcible 
kit { "Geneclean" , BIO 101 Inc., La Jolla, Ca.). The fragment 
was then digested with the endonucleases BamHI and Asp781 and 
then pxurif ied by isolation on a 1% agarose gel . This 
fragment is designated F2. 

The vector pRGl (modification of pVL941 vector, discussed 
below) is used for the expression of the creF-2 protein using 
the baculovirus expression system (for review see: Summers, 
M.D. and Smith, 1987, A manual of methods for 

baculovirus vectors and insect cell culture procedures, Texas 
Agricultural Experimental Station Bulletin No. 1555) . This 
expression vector contains the strong polyhedrin promoter of 
the Autographa califomica nuclear polyhedrosis virus 
(AcMNPV) followed by the recognition sites for the 
restriction endonucleases BamHI and Asp781. The 
polyadenylation site of the simian virus (SV)40 is used for 
efficient polyadenylation. For an easy selection of 
recombinant viruses the beta-galactosidase gene from E.coli 
is inserted in the same orientation as the polyhedrin 
promoter followed by the polyadenylation signal of the 
polyhedrin gene. The polyhedrin sequences are flanked at 
both sides by viral sequences for the cell -mediated 
homologous recombination of cotransf ected wild-type viral 
DNA. Many other baculovirus vectors could be used in place 
of pRGl such as pAc373, pVL941 and pAcIMl (Luckow, V.A. and 
Summers, M.D., Virology, 170:31-39). ^ 

The plasmid was digested with the restriction enzymes 
BamHI and Asp781 and then dephosphorylated using calf 
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intestinal phosphatase by procedures known in the art. The 
DNA was then isolated from a 1% agarose gel. This vector DNA 
is designated V2. 

Fragment F2 and the dephosphorylated plasmid V2 were 
ligated with T4 DNA ligase. E.coli HBlOl cells were then 
trcuisformed and bacteria identified that contained the 
plasmid {pBacCrGF-2) with the CT6F-2 gene using the enzymes 
BamHl and Asp781. The sequence of the cloned fragment was 
confirmed by DNA sequencing. 

5 iig of the plasmid pBacCTGF-2 were cotransfected with 
1.0 /xg of a commercially available linearized baculovirus 
{■"BaculoGold"^ baculovirus DNA", Pharmingen, San Diego, CA.) 
using the lipofection method (Feigner et al. Proc. Natl. 
Acad. Sci. USA, 84:7413-7417 (1987)). 

Ifig of BaculoGold™ virus DNA and 5 fig of the plasmid 
pBacCTGF-2 were mixed in a sterile well of a microtiter plate 
containing 50 /il of serum free Grace's medium (Life 
Technologies Inc., Gaithersburg, MD) . Afterwards 10 /zl 
Lipofectin plus 90 fil Grace's medium were added, mixed and 
incubated for 15 minutes at room temperature. Then the 
transfection mixture was added dropwise to the Sf9 insect 
cells (ATCC CRL 1711) seeded in a 35 mm tissue culture plate 
with iml Grace' medium without serum. The plate was rocked 
back and forth to mix the newly added solution. The plate 
was then incubated for 5 hoxirs at 27«*C. After 5 hours the 
transfection solution was removed from the plate and l ml of 
Grace's insect mediiam supplemented with 10% fetal calf serum 
was added. The plate was put back into an incubator and 
cultivation continued at 27**C for four days. 

After four days the supernatant was collected and a 
plaque assay performed similar as described by Siimmers and 
Smith (supra) . As a modification an agarose gel with "Blue 
Gal" (Life Technologies Inc., Gaithersburg) was used which 
allows an easy isolation of blue stained plac[ues. (A 
detailed description of a "plaque assay" can also be found in 
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the user's guide for insect cell culture and baculovirology 
distributed by Life Technologies Inc., Gaithersburg, page 9* 
10) . 

Fotir days after the serial dilution of the viruses was 
added to the cells, blue stained plagues were picked with the 
tip of an Eppendorf pipette. The agar containing the 
recombinant viruses was then resuspended in an Sppendorf tube 
containing 200 fil of Grace's medium. The agar was removed by 
a brief centrifugation and the supesmatant containing the 
recombinant baculoviruses was used to insect Sf 9 cells seeded 
in 35 mm dishes. Four days later the supernatant s of these 
culture dishes were harvested and then stored at 4**C. 

Sf9 cells were grown in Grace's medium supplemented with 
10% heat -inactivated FBS. The cells were infected with the 
recombinant baculovirus V-CTGF-2 at a multiplicity of 
infection (MOD of 2. Six hours later the mediimi was removed 
and replaced with SP900 II medium minus methionine and 
cysteine (Life Technologies Inc . , Gaithersburg) . 42 hours 
later 5 fiCi of ^^S -methionine and 5 fiC± cysteine (Amersham) 
were added. The cells were further incubated for 16 hours 
before they were harvested by centrifugation and the labelled 
proteins visualized by SDS-PAGE and autoradiography. 

Example 2 

Expression of Recombinant CTGF-2 in COS cells 

The expression of plasmid, CTGF-2 HA is derived from a 
vector pcDNAI/Amp (Invitrogen) containing: 1) SV40 origin of 
replication, 2) atr^icillin resistance gene, 3) E.coli 
replication origin, 4) CMV promoter followed by a poly linker 
region, a SV40 intron and polyadenylation site. A DNA 
fragment encoding the entire CT6F-2 precursor and a HA tag 
fused in frame to its 3' end was cloned into the polylinker 
region of the vector, therefore, the recombinant protein 
expression is directed under the CMV promoter. The HA tag 
correspond to an epitope derived from the influenza 
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hemagglutinin protein as previously described {I. Wilson, H, 
Niman, Heighten, A Qierenson, Connolly, and R, Lemer, 
1984, Cell 37, 767) • The infusion of HA tag to our target 
protein allows easy detection of the recombinant protein with 
an antibody that recognizes the HA epitope. 

The plasmid construction strategy is described as 
follows : 

The DNA sequence encoding for CT6F-2, ATCC # 75804, was 
constructed by PCR on the full --length clone using two 
primers: the 5' primer 5' AAAGGATCCACAATGAGCTCCCGAATC (SEQ 
ID NO: 5) 3' contains a Bam HI site followed by 18 
nucleotides of CT6P-2 coding sequence starting from the -3 
position relative to the initiation codon; the 3' sequence 5' 
axrrcrAGATTAAGGGTAGTCTGGGAaxrC^ 5' 
(SEQ ID NO: 6) contains complementary sequences to an Xba I 
site, translation stop codon, HA tag and the last 19 
nucleotides of the CTGF-2 coding sequence (not including the 
stop codon) . Therefore, the PCR product contains a Bam HI 
site, CTGF-2 coding sequence followed by an HA tag fused in 
frame, a translation termination stop codon next to the HA 
tag, and an Xba I site. The PCR amplified DNA fragment and 
the vector, pcDNAI/Amp, were digested with Bam HI and Xba I 
restriction enzymes and ligated. The ligation mixture was 
trcmsformed into E. coli strain SURE, {available from 
Stratagene Cloning Systems, 11099 North Torrey Pines Road, La 
Jolla, CA. 92037) the transformed culture was plated on 
ampicillin media plates and resistant colonies were selected. 
Plasmid DNA was isolated from transf ormants and examined by 
restriction analysis for the presence of the correct 
fragment. For expression of the recombinant CTGP-2, COS 
cells were transfected with the expression vector by DBAE- 
DEXTRAN method. (J« Sambrook, E. Fritsch, T. Maniatis, 
Molecular Cloning: A Laboratory Manual, Cold Spring 
Laboratory Press, (1989)). The expression of the CrGF-2 HA 
protein was detected by radio label ling and 
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iinrmmoprecipitation method. (E. Harlow, D. Lane, Antibodies: 
A Laboratory Manual, Cold Spring Harbor Laboratory Press, 

(1988) ) . Cells were labelled for 8 hoxirs with ^*S-cysteine 
two days post transfection. Culture media were then 
collected and cells were lysed with detergent (RIPA buffer 

(150 TCM NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, SOrtiM 
Tris, pH 7.5). (Wilson, I. et al.. Id. 37:767 (1984)). Both 
cell lysate and culture media were precipitated with a HA 
specific monoclonal antibody. Proteins precipitated were 
analyzed on 15% SDS -PAGE gels. 

Example 3 
Expression via Gene Therapy 

Fibroblasts are obtained from a sxibject by skin biopsy. 
The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of the tissue are 
placed on a wet surface of a tissue culture flask, 
approximately ten pieces are placed in each flask. The flask 
is turned upside down, closed tight and left at room 
ten5)erature over night. After 24 hours at room temperatizre, 
the flask is inverted and the chunks of tissue remain fixed 
to the bottom of the flask and fresh media (e.g., Ham's P12 
media, with 10% FBS, penicillin and streptomycin, is added. 
This is then incubated at 37**C for appro3cimately one week. 
At this time, fresh media is added and subsequently changed 
every several days. After an additional two weeks in 
culture, a monolayer of fibroblasts emerge. The monolayer is 
trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al, DNA, 7:219-25 (1988) 

flanked by the long terminal repeats of the Moloney murine 
sarcoma virus, is digested with EcoRI and Hindlll and 
subsequently treated with calf intestinal phosphatase. The 
linear vector is fractionated on agarose gel cuid purified, 
using glass beads. 
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The cDNA encoding a polypeptide of the present invention 
is attplif ied using PGR primers which correspond to the 5 ' and 
3' end sequences respectively • The 5' primer containing an 
EcoRI site and the 3' primer further includes a Hindlll site. 
Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the anqolif ied EcoRI and Hindlll fragment are 
added together, in the presence of T4 DNA. ligase. The 
resulting mixture is maintained under conditions appropriate 
for ligation of the two fragments. The ligation mixture is 
used to transform bacteria HBlOl, which are then plated onto 
agar-containing kanattQrcin for the purpose of confirming that 
the vector had the gene of interest properly inserted. 

The ait5)hotropic pA3l7 or GP+aml2 packaging cells are 
grown in tissue culture to confluent density in Dulbecco's 
Modified Eagles Medium (DHEM) with 10% calf senrni (CS) , 
penicillin and streptomycin. The MSV vector containing the 
gene is then added to the media and the packaging cells are 
transduced with the vector. The packaging cells now produce 
infectious viral particles containing the gene (the packaging 
cells are now referred to as producer cells) . 

Fresh media is added to the transduced producer cells, 
and subsequently, the media is harvested from a 10 cm plate 
of confluent producer cells. The spent media, containing the 
infectious viral particles, is filtered through a millipore 
filter to remove detached producer cells and this media is 
then used -to infect fibroblast cells. Media is removed from 
a siib-confluent plate of fibroblasts and quickly replaced 
with the media from the producer cells. This media is 
removed and replaced with fresh media. If the titer of virus 
is high, then virtually all fibroblasts will be infected and 
no selection is required. If the titer is very low, then it 
is necessaary to use a retroviral vector that has a selectable 
marker, such as neo or his . 

The engineered fibroblasts are then injected into the 
host, either alone or after having been grown to confluence 
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on cytodex 3 microcarrier beads . The fibroblasts now produce 
the protein product. 

Niamerous modifications and variations of the present 
invention are possible in light of the above teachings and, 
therefore, within the scope of the appended claiios, the 
invention may be practiced otheirwise than as particularly 
described • 



SEQUENCE LISTING 



(1) GENERAL INFOHMATION: 

(i) APPLICS^: LI, ET AL. 



(ii) 



TITLE OF INVENTION: 



Connective Tissue Growth Pactor-2 



(iii) 



NDMBBR OF SEQUENCES; 



2 



(iv) 



CORRESPONDENCE ADDRESS: 



(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

CECCHI, STEWART & OLSTEIN 

(B) STREET: 6 BECKER FARM ROAD 
{C} CITY: ROSELAND 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 
(P) ZIP: 07068 

(v) CXMPTJTBR READABLE FORM: 

(A) MEDIUM TYPE: 3.5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: WORD PERFECT 5.1 . 

(vi) CDRRENT APPLICATION DATA: 



(A) APPLICATION NUMBER: 

(B) FILING DATE: Concurrently 

(C) CLASSIFICATION: 



(vii) 



PRIOR APPLICATION DATA 



(A) APPLICATION NUMBER: PCT/US94/07736 

(B) FILING DATE: 12 JUL 94 
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(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: FERHARO, GREGORY D. 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE/DOCKET NDMBER: 325800-317 

(ix) TELECX3MMDNICATIQN INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID N0:1: 

(i)- SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1128 BASE PAIRS 

I* 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATGAGCTCCC GAATCGTCAG GGAGCTCGCC TTAGTCGTCA CCCTTCTCCA CTTGACCAGG 
GTGGGGCTCr CCACCTGCCC CGCTGACTGC CACTGCCCCC TGGAGGC6CC CAAGTGCGCG 
CCG6GAGTCG GQCTGGTCCG GGACGGCTGC GGCTOTTQTA AGGTCT6C6C CAAGCAGCTC 
AACGAGGACrr GCAGAAAAAC GCAGCCCTGC GACCACACCA AGGGGCTGGA ATGCAACTTC 
GGCGCCAGCr CCACCGCTCT GAAGGGGATC TGCAGAGCTC AGTCAGAGGG CAGACCCTGT 
GAATATAACT CCAGAATCTA CCAAAACGGG GAAAGTTTCC AGCCCAACXG TAAACATCAG 
TGCACATGTA TTGGATGGCG CCGGGGGGCT TGCATTCCTC TGTGTCCCCA AjGAACTATCT 
CCCCCCAACr TGGGCTGTCC CAACCCTCGG CTGGTCAAAG TTACCGGGCA GTGCTGCGAG 
GAGTGGGTCr GTGACGAGGA TAGTATCA2VG GACCCCATGG AGGACCAGGA CGGCCTCCTT 
GGCAAGGGGC 7GGGATTCGA TGCCTCCGAG GTGGAGTTGA CGAGA2UICAA TGAATTGATT 
GCAOTTGGAA AAGGCAGCTC ACTGAAGCGG CTCCCrGTTT TTGGAATGGA GCCTCGCATC 
CTATACAACC CETrACAAGG CCAGAAATGT ATTGTTCAAA CAACTTCATG GTCCCAGTGC 
TCAAAGACCr GTGGAACTGG TATCTCCACA CGAGTTACCA ATGACAACCC TGAGTGCCGC 
CTTGTGAAAG AAACCCGGAT TTGTGAGGTG CGGCCTTGTG GACAGCCAGT GTACAGCAGC 
CTGAAAAAGG GCAAGAAATG CAGCAAGACC AAGAAATCCC CCGAACCAGT CAGGTTTACT 
TACGCTGGAT GTTTGAGTGT GAAGAAATAC CGGCCCAAGT ACTGCGGTTC CTGCGTGGAC 
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GGCCC3ATGCT GCACGCCCCA GCTGACCAGG ACTGTGAAGA TGCGGTTCCC CTGCGAAGAT 1020 
GGGGAGACAT TTTCCAAGAA CGTCATGATG ATCCAGTCCT CCAAATGCaA CTACAACTGC 1080 
CCGCATGCCA ARGAAGCAGC GTrTCCCTTC TAC3U3GCTGT TCCAftTGA 1128 

(2) INFORMATION FOR SEQ ID NO: 2: 
(1) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 375 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS t 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ser Ser Arg He Val Arg Glu Leu Ala Leu Val Val Thr Leu 

-20 -15 -10 

Leu His Leu Thr Arg Val Gly Leu Ser Thr Cys Pro Ala Asp Cys 

-5 15 
His Cys Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu 

10 15 20 

Val Arg Asp Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu 

25 30 35 

Asn Glu Asp Cys Arg Lys Thr Gin Pro Q^s Asp His Thr Lys Gly 

40 45 ' 50 

Leu Glu Cys Asn Phe Gly Ala Ser Ser Thr Ala Leu Lys Gly He 

55 60 65 

Cys Arg Ala Gin Ser Glu Gly Arg Pro Cys Glu Tyr Asn Ser Arg 

70 75 80 

He Tyr Gin Asn Gly Glu Ser Phe Gin Pro Asn Cys Lys His Gin 

85 90 95 

Cys Thr Cys He Gly Thp Arg Arg Gly Ala Cys He Pro Leu Cys 

100 105 110 

Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly Cys Pro Asn Pro Arg 

120 125 
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Leu Val Lys 
61u Asp Ser 
Gly Lys 61y 
Asn Asn 61u 
Leu Pro Val 
Gin Gly Gin 

* 

Ser Lys Thr 
Asn Pro Glu 
Arg Pro Cys 
Lys Cys Ser 
Tyr Ala Gly 
Gly Ser Cys 
Thr Val Lys 
Lys Asn Val 
Pro His Ala 



Val Thr Gly 
130 

lie Lys Asp 
145 

Leu Gly Phe 
160 

Leu lie Ala 
175 

Phe Gly Met 
190 

Lys Cys lie 

205 

Cys Gly Thr 
220 

Cys Arg Leu 
235 

Gly Gin Pro 
250 

Lys Thr Lys 
265 

Cys Leu Ser 
280 

Val Asp Gly 
295 

Met Arg Phe 
310 

Met Met lie 
325 

Asn Glu Ala 
340 



Gin Cys Cys 

135 

Pro Met Glu 

150 

Asp Ala Ser 

165 

Val Gly Lys 

180 

Glu Pro Arg 

195 

Val Gin Thr 

210 

Gly lie Ser 

225 

Val Lys Glu 

240 

Val Tyr Ser 

255 

Lys Ser Pro 

270 

Val Lys Lys 

285 

Arg Cys Cys 

300 

Pro Cys Glu 

315 

Gin Ser Ser 

330 

Ala Phe Pro 

345 



Glu Glu Trp 
Asp Gin Asp 
Glu Val Glu 
Gly Ser Ser 
lie Leu Tyr 
Thr Ser Trp 
Thr Arg Val 
Thr Arg lie 
Ser Leu Lys 
Glu Pro Val 
Tyr Arg Pro 
Thr Pro Gin 
Asp Gly Glu 
Lys Cys Asn 
Phe Tyr Arg 



Val C^s Asp 

140 
Gly Leu Leu 

155 
Leu Thr Arg 

170 
Leu Lys Arg 

185 
Asn Pro Leu 

200 
Ser Gin Cys 

215 
Thr Asn Asp 

230 
Cys Gly Val 

245 
Lys Gly Lys 

260 
Arg Phe Thr 

275 
Lys Tyr Cys 

290 
Leu Thr Arg 

305 
Thr Phe Ser 

320 
Tyr Asn Cys 

335 
Leu Phe Gin 

350 
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WHAT IS CIAIMBD IS : 

1* An isolated polynucleotide coirprising a member selected from the 
group consisting of: 

(a) a polynucleotide encoding the polypeptide conprising amino acid 
-24 to amino acid 351 as set forth in SEQ ID N0:2; 

(b) a polynucleotide encoding the polypeptide cottprising amino acid 
1 to amino acid 351 as set forth in SEQ ID NO: 2 

(c) a polynucleotide capable of hybridizing to and which is at 
least 70% identical to the polynucleotide of (a) or (b) ; and 

(d) a polynucleotide fragment of the polynucleotide of (a) , (b) or 
(c). 



2* The polynucleotide of Claim 1 wheredLn the polynucleotide is DNA. 

3. The polynucleotide of Claim 2 which encodes the polypeptide 
comprising amino acid 1 to 351 of SEQ ID NO: 2. 

4. An isolated polynucleotide cottprising a member selected from the 
group consisting of: 

(a) a polynucleotide which encodes a mature polypeptide having the 
amino acid sequence expressed by the DNA contained in ATCC Deposit No. 

75804; 

(b) a polynucleotide which encodes a polypeptide having the amino 
acid sequence expressed by the DNA contained in ATCC Deposit No • 

75804; 

(c) a polynucleotide capable of hybridizing to and which is at 
least 70% identical to the polynucleotide of (a) ; and 

(d) a polynucleotide fragment of the polynucleotide of (a) , (b) or 
(c) . 

5. The polynucleotide of claim 1 comprising the sequence as set 
forth in SEQ ID No, 1 from nucleotide 1 to nucleotide 1128. 



6. The polynucleotide of claim 1 comprising the sequence as set 
forth in SEQ ID No. 1 from nucleotide 73 to nucleotide 1128. 
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7. A vector containing the DNA of Claim 2. 

8. A host cell genetically engineered with the vector of Claim ?• 

9. A process for producing a polypeptide comprising: expressing from 
the host cell of Claim 8 the polypeptide encoded by said DNA. 

10. A process for producing cells capable of expressing a polypeptide 
comprising genetically engineering cells with the vector of Claim 7. 

11. A polypeptide encoded by the polynucleotide of claim 1 comprising 
a member selected from the group consisting of (i) a mature 
polypeptide having the deduced amino acid sec[uence of SEQ ID NO: 2 and 
fragments, analogs and derivatives thereof; and (ii) a mature 
polypeptide encoded by the cDNA of ATCC Deposit No. 75804 and 
fragments, analogs and derivatives of said polypeptide. 

12. The polypeptide of Claim 11 wherein the polypeptide comprises 
amino acid 1 to amino acid 351 of SEQ ID NO: 2. 

13. A compound which inhibits activation of the receptor for the 
polypeptide of claim 11. 

14. A compound which activates the receptor for the polypeptide of 
claim 11. 

15. A method for the treatment of a patient having need of CTGP-2 
con5)rising: administering to the patient a therapeutically effective 
amount of the polypeptide of claim 11. 

16. The method of Claim 15 wherein said therapeutically effective 
amount of the polypeptide is administered by providing to the patient 
DNA encoding said polypeptide and expressing said polypeptide in vivo. 



43- 



1 



17. A method for the treatment of a patient having need to inhibit a 
crGF-2 polypeptide comprising: administering to the patient a 
therapeutically effective amount of the compound of Claim 13. 

18. A process for diagnosing a disease or a susceptibility to a 
disease related to an under-eapression of the polypeptide of claim 11 
con^rising; 

determining a mutation in a nucleic acid sequence encoding said 
polypeptide . 

19. A diagnostic process comprising: 

analyzing for the presence of the polypeptide of claim 11 in a 
sample derived from a host. 

20 . A method for identifying agonist or antagonist compounds to the 
polypeptide of claim 11 comprising: 

contacting a cell e^ressing on the surface thereof a receptor for 
the polypeptide, said receptor being associated with a second 
con?>onent capable of providing a detectable signal in response to the 
binding of a coii?)ound to said receptor, with an analytically 
detectable compound under conditions to permit binding to the 
receptor; 

detecting the absence or presence of a signal generated from the 
interaction of the compound with the receptor, 
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ABSTRACT OF TBE DZSCLOSUSE 

The present invention relates to a human CrGP-2 polypeptide and DNA 
(KHA) encoding such polypeptide. Also provided is a procediire for 
producing such polypeptide by recombinant techniques and antibodies 
and antagonist/inhibitors against such polypeptide. Also provided are 
methods of using the polypeptide therapeutically for enhancing the 
repair of connective and support tissue, promoting the attachment, 
fixation and stabilization of tissue implants and enhancing wound 
healing. Diagnostic assays for identifying mutations in nucleic acid 
sequence encoding a polypeptide of the present invention and for 
detecting altered levels of the polypeptide of the present invention 
are also disclosed. 
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DEC JIATION AND POWER OF ATTORNEY 



As a below named inventor, I hereby declare that: 



My residence, post office address and citizenship are as stated below next to my name. 



I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural names are listed below) 
of the subject matter which is claimed and for which a patent is sought on the invention entitled: 

CONNECTIVE TISSUE GROWTH FACTOR-2 



the specification of which [X] is attached hereto or [ ] was filed on 
amended on (if applicable) 



as Application Serial No. 



and was 



I hereby state that I have reviewed and understand the contents of the above idendfied specification, including the claims, as amended by any amendment 
referred to above. 

I acknowledge the duty to disclose- information which is material to the patentability of this application in accordance with Title 37, Code of Federal 
Regulations, Section 1.56(a). 

I hereby claim foreign priority benefits under Title 35 , United States Code, Section 1 19 of any foreign application(s) for patent or inventor's certificate listed 
below and have also identified below any foreign ^plication for patent or inventor's certificate having a filing date before that of the application on which 
if priority is claimed. Prior Foreign Application(s): 

^ Priority Claimed 

3 Yes No 

fr: (Number) (Country) (Day/Month/Year Filed) 

I □ □ 

^ I hereby claim the benefit under Title 35, United States Code, Section 120 of any United States application(s) listed below and, insofar as the subject matter 
' of each of the claims of this application is not disclosed in the prior United States application in the manner provided by the first paragraph of Title 35, United 
States Code, Section 1 12, 1 acknowledge the duty to disclose material information as defined in Title 37, Code of Federal Regulations, Section 1 .56(a) which 
3 occurred between the filing date of the prior application and the national or PCT international filing date of this application: 



PCT/US94/07736 



7/12/94 



(Application Serial No.) 



(Filing Date) 



Pending 



(Status - patented, pending, abandoned) 



(Application Serial No.) 



(Filing Date) 



(Status - patented, pending, abandoned) 



I hereby appoint the following attomey(s) and/or agent(s) to prosecute this application and to transact all business in the Patent and Trademark Office connected 
therewith: John N. Bain (Reg. No. 18,651); John G. Gilfillan, III (Reg. No. 22,746); Elliot M. Olstein (Reg. No. 24.025); Raymond J. Lillie (Reg. No. 
3 1 ,778); Charles J. Herron (Reg. No. 28,0 19); William Squire (Reg. No. 25,378); and Gregory Ferraro (Reg. No. 36, 134) of Carella, Bryne, Bain, GUlfillan 
Cecchi, Stewart & Olstein; Robert H. Benson (Reg. No. 30,446) of Human Genome Sciences, Inc., 9410 Key West Ave, Rockville, Maryland 20850-3338. 
Address correspondence and tel^hone calls to Gregory D. Ferraro c/o Carella, Byrne, Bain, Gilfillan, Cecchi, Stewart & Olstein, 6 Becker Farm Road, 
Roseland, NJ 07068 - (201) 994-1700. 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are believed to be true; 
and further that these statements were made mth the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, 
or both, under Section 100 1 of Title 18 of the United States Code and that such willful false statements may jeopardize the validity of the application or any 
patent issued thereon. 



Full name of first joint inventor: Haodong LI 



Inventor's signature: /)^/.tA>?-^^'-^ "^^^ Date: _ 

Residence: 11033 Rutledge Drive Citizenship: China 

Gaithersburg, Maryland 20878 
Post Office Address: Same as Above 



Full name of second joint inventor: Mark D. ADAMS 




clik. 



Inventor's signature: / ^ jr 4 ^ — ^-^^ Date: _ 

Residence: 15205 Duflef Drive' Citizenship: USA 

North Potomac, Maryland 20878 
Post Office Address: Same as Above 



CARELLA, BYRNE, BAIN, GILFILLAN, CECCHI, STEWART & OLSTEIN 
6 Becker Farm Road - Roseland, NJ 07068 - (201) 994-1700 



Docket No. 32S800.317 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re application of: Li et al. 

Divisional of Application Serial No: 08/459,101 Parent Filed: June % 1995 

Art Unit: Unassigned Examiner: Unassigned 

For: Connective Tissue Growth Factor-2 Agent Ref No: PF126P1D1 

POWER OF ATTORNEY BY ASSIGNEE OF ENTIRE INTEREST 
REVOCATION OF PRIOR POWER OF ATTORNEY 

Assistant Commissioner 
for Patents 
Washington, D.C. 20231 

Sir: 

Human Genome Sciences, Inc., organized and existing under the laws of the State of Delav^are, 
having its principal place of business at 9410 Key West Avenue, Rockville, Maryland 20850, is 
assignee of record of the entire right, title and interest for the above-identified apphcation by virtue of an 
assignment of parent application Serial No. 08/459 JOl filed in the U.S. Patent and Trademark Office 
on April 22, 1996 , at Reel 7937 , Frame 0374 and on April 14, 1996 . at Reel 7950 , Frame 
0718, 

The undersigned has reviewed all the documents in the chain of title of the patent application 
identified above and, to the best of undersigned's knowledge and belief, title is in the assignee identified 
above. 

The assignee hereby revokes all powers of attomey heretofore given in the above-captioned 
apphcation and appoints as its attomey(s) and agent(s): James H. Davis, (Reg. No. 40,582), A. Anders 
Brookes (Reg. No. 36,373), Kenley K. Hoover (Reg. No. 40,302), Michele M. Wales (Reg. No, 
43,975) and Joseph J. Kenny (Reg. No. 43,710), with full power of substitution, association, and 
revocation, to prosecute said application and to transact all business in the United States Patent and 
Trademark Office connected therewi±. 

The assignee also requests that the correspondence address be changed to the following: 

James H, Davis, Esq. 

Human Genome Sciences, Inc. 

9410 Key West Avenue 

Rockville, Maryland 20850 

Phone: 301-309-8504 Fax: 301-309-8439 



The undersigned, whose title is supplied below, is empowered to sign this document on behalf 
of the assignee. 

I hereby declare that all statements made herein of my own knowledge are true, and that all 
statements made on information and belief are believed to be true; and further, that these statements are 
made with the knowledge that willful false statements, and the like so made, are punishable by fine or 
imprisonment, or both, under Section 1001, Title 18 of the United States Code, and that such willful 
false statements may jeopardize the validity of the application captioned above or of any patent issuing 
thereon. 

On behalf of Human Genome Sciences, Inc. 



For: 



Human Genome Sciences. Inc 



By: 



Signature: 




Title: 



Senior Vice President and General Counsel 



Date: 




AAB/mbp 



38 

SEQUENCE LISTING 



1) GENERAL INFORMATION: 



(i) 



APPLICANT: LI, HAODONG 

ADAMS, MARK D 



(ii) TITLE OF INVENTION: CONNECTIVE TISSUE GROWTH FACTOR- 
(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HUMAN GENOME SCIENCES, INC. 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CITY: ROCKVILLE 

(D) STATE: MD 

(E) COUNTRY: US 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0/ Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/459,101 

(B) FILING DATE: 02-JUN-1994 
{ C ) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: A. ANDERS BROOKES 

(B) REGISTRATION NUMBER: 3 6,373 

(C) REFERENCE/DOCKET NUMBER: PF126P1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 301-309-8504 

(B) TELEFAX: 301-309-8439 



2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1127 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 



(ix) FEATURE: 



(A) NAME/KEY: 

(B) LOCATION: 



CDS 

1, .1122 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



ATG AGC TCC CGA ATC GTC AGG GAG CTC GCC TTA GTC GTC ACC CTT CTC 
. Met Ser Ser Arg He Val Arg Glu Leu Ala Leu Val Val Thr Leu Leu 
15 10 15 

CAC TTG ACC AGG GTG GGG CTC TCC ACC TGC CCC GCT GAC TGC CAC TGC 
His Leu Thr Arg Val .Gly Leu Ser Thr Cys Pro Ala Asp Cys His Cys 

20 25 30 

CCC CTG GAG GCG CCC AAG TGC GCG CCG GGA GTC GGG CTG GTC CGG GAC 
Pro Leu Glu Ala Pro Lys Cys Ala Pro Gly Val Gly Leu Val Arg Asp 
35 40 45 

GGC TGC GGC TGT TGT AAG GTC TGC GCC AAG CAG CTC AAC GAG GAC TGC 
Gly Cys Gly Cys Cys Lys Val Cys Ala Lys Gin Leu Asn Glu Asp Cys 
50 55 60 

AGA AAA ACG CAG CCC TGC GAC CAC ACC AAG GGG CTG GAA TGC AAC TTC 
Arg Lys Thr Gin Pro Cys Asp His Thr Lys Gly Leu Glu Cys Asn Phe 

70 75 80 

GGC GCC AGC TCC ACC GCT CTG AAG GGG ATC TGC AGA GCT CAG TCA GAG 
Gly Ala Ser Ser Thr Ala Leu Lys Gly He Cys Arg Ala Gin Ser Glu 

85 90 95 

GGC AGA CCC TGT GAA TAT AAC TCC AGA ATC TAC CAA AAC GGG GAA AGT 
Gly Arg Pro Cys Glu Tyr Asn Ser Arg He Tyr Gin Asn Gly Glu Ser 

100 105 110 

TTC CAG CCC AAC TGT AAA CAT CAG TGC ACA TGT ATT GGA TGG CGC CGG 
Phe Gin Pro Asn Cys Lys His Gin Cys Thr Cys He Gly Trp Arg Arg 
115 120 125 

GGG GCT TGC ATT CCT CTG TGT CCC CAA GAA CTA TCT CTC CCC AAC TTG 
Gly Ala Cys He Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu 
130 135 140 

GGC TGT CCC AAC CCT CGG CTG GTC AAA GTT ACC GGG CAG TGC TGC GAG 
Gly Cys Pro Asn Pro Arg Leu Val Lys Val Thr Gly Gin Cys Cys Glu 
1^5 150 155 160 

GAG TGG GTC TGT GAC GAG GAT AGT ATC AAG GAC CCC ATG GAG GAC CAG 
Glu Trp Val Cys Asp Glu Asp Ser He Lys Asp Pro Met Glu Asp Gin 

165 170 175 

GAC GGC CTC CTT GGC AAG GGG CTG GGA TTC GAT GCC TCC GAG GTG GAG 
Asp Gly Leu Leu Gly Lys Gly Leu Gly Phe Asp Ala Ser Glu Val Glu 

180 185 190 

TTG ACG AGA AAC AAT GAA TTG ATT GCA GTT GGA AAA GGC AGC TCA CTG 
Leu Thr Arg Asn Asn Glu Leu He Ala Val Gly Lys Gly Ser Ser Leu 
195 200 205 
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AAG CGG CTC CCT GTT TTT GGA ATG GAG OCT CGC ATC CTA TAG AAC OCT 672 
Lys Arg Leu Pro Val Phe Gly Met Glu Pro Arg lie Leu Tyr Asn Pro 
210 215 220 

TTA CAA GGC GAG AAA TGT ATT GTT CAA ACA ACT TCA TGG TCC GAG TGC 720 
Leu Gin Gly Gin Lys Cys lie Val Gin Thr Thr Ser Trp Ser Gin Cys 
225 230 235 240 

TCA AAG ACC TGT GGA ACT GGT ATC TCC ACA CGA GTT ACC AAT GAG AAC 768 
Ser Lys Thr Cys Gly Thr Gly lie Ser Thr Arg Val Thr Asn Asp Asn 

245 250 255 

CCT GAG TGC CGC CTT GTG AAA GAA ACC CGG ATT TGT GAG GTG CGG CCT 816 
Pro Glu Cys Arg Leu Val Lys Glu Thr Arg lie Cys Glu Val Arg Pro 

260 265 270 

TGT GGA GAG CCA GTG TAG AGC AGC CTG AAA AAG GGC AAG AAA TGC AGC 864 
Cys Gly Gin Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser 
275 280 285 

AAG ACC AAG AAA TCC CGC GAA CCA GTG AGG TTT ACT TAG GCT GGA TGT 912 
Lys Thr Lys Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys 
290 295 300 

TTG AGT GTG AAG AAA TAG CGG CGC AAG TAG TGC GGT TCC TGC GTG GAC 960 
Leu Ser Val Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp 
305 310 315 320 

GGC CGA TGC TGC ACG CGC GAG CTG ACC AGG ACT GTG AAG ATG CGG TTC 1008 
Gly Arg Cys Cys Thr Pro Gin Leu Thr Arg Thr Val Lys Met Arg Phe 

325 330 335 

CGC TGC GAA GAT GGG GAG ACA TTT TCC AAG AAC GTG ATG ATG ATC GAG 1056 
Pro Cys Glu Asp Gly Glu Thr Phe Ser Lys Asn Val Met Met He Gin 

340 345 350 

TCC TCC AAA TGC AAC TAG AAC TGC CGG CAT GCC AAG AAG CAG GGT TTC 1104 
Ser Ser Lys Cys Asn Tyr Asn Cys Pro His Ala Lys Lys Gin Arg Phe 
355 360 365 

CCT TGT ACA GGC TGT TCC AATGA 1127 
Pro Ser Thr Gly Cys Ser 
370 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Ser Ser Arg 

1 

His Leu Thr Arg 

20 

Pro Leu Glu Ala 
35 

Gly Cys Gly Cys 
50 

Arg Lys Thr Gin 
65 

Gly Ala Ser Ser 



Gly Arg Pro Cys 

100 

Phe Gin Pro Asn 
115 

Gly Ala Cys lie 
130 

Gly Cys Pro Asn 
145 

Glu Trp Val Cys 



Asp Gly Leu Leu 

180 

Leu Thr Arg Asn 
195 

Lys Arg Leu Pro 
210 

Leu Gin Gly Gin 
225 

Ser Lys Thr Cys 



Pro Glu Cys Arg 

260 

Cys Gly Gin Pro 
275 



lie Val Arg Glu 
5 

Val Gly Leu Ser 



Pro Lys Cys Ala 

40 

Cys Lys Val Cys 
55 

Pro Cys Asp His 
70 

Thr Ala Leu Lys 
85 

Glu Tyr Asn Ser 



Cys Lys His Gin 

120 

Pro Leu Cys Pro 
135 

Pro Arg Leu Val 
150 

Asp Glu Asp Ser 
165 

Gly Lys Gly Leu 



Asn Glu Leu lie 

200 

Val Phe Gly Met 
215 

Lys Cys lie Val 
230 

Gly Thr Gly lie 
245 

Leu Val Lys Glu 



Val Tyr Ser Ser 

280 



Leu Ala Leu Val 
10 

Thr Cys Pro Ala 
25 

Pro Gly Val Gly 



Ala Lys Gin Leu 

60 

Thr Lys Gly Leu 
75 

Gly lie Cys Arg 
90 

Arg lie Tyr Gin 
105 

Cys Thr Cys lie 



Gin Glu Leu Ser 

140 

Lys Val Thr Gly 
155 

lie Lys Asp Pro 
170 

Gly Phe Asp Ala 
185 

Ala Val Gly Lys 



Glu Pro Arg lie 

220 

Gin Thr Thr Ser 
235 

Ser Thr Arg Val 
250 

Thr Arg lie Cys 
265 

Leu Lys Lys Gly 



Val Thr Leu Leu 

15 

Asp Cys His Cys 
30 

Leu Val Arg Asp 
45 

Asn Glu Asp Cys 



Glu Cys Asn Phe 

80 

Ala Gin Ser Glu 

95 

Asn Gly Glu Ser 
110 

Gly Trp Arg Arg 
125 

Leu Pro Asn Leu 



Gin Cys Cys Glu 

160 

Met Glu Asp Gin 
175 

Ser Glu Val Glu 
190 

Gly Ser Ser Leu 
205 

Leu Tyr Asn Pro 



Trp Ser Gin Cys 

240 

Thr Asn Asp Asn 
255 

Glu Val Arg Pro 
270 

Lys Lys Cys Ser 
285 
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Lys Thr Lys Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys 
290 295 300 

Leu Ser Val Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp 
305 310 315 320 

Gly Arg Cys Cys Thr Pro Gin Leu Thr Arg Thr Val Lys Met Arg Phe 

325 330 335 

Pro Cys Glu Asp Gly Glu Thr Phe Ser Lys Asn Val Met Met lie Gin 

340 345 350 

Ser Ser Lys Cys Asn Tyr Asn Cys Pro His Ala Lys Lys Gin Arg Phe 
355 360 365 

Pro Ser Thr Gly Cys Ser 
370 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DEI 

Met Ser Ser Ser 
1 

His Leu Thr Arg 

20 

Pro Leu Glu Ala 
35 

Gly Cys Gly Cys 
50 

Ser Lys Thr Gin 
65 

Gly Ala Ser Ser 



Gly Arg Pro Cys 

100 



CRIPTION: SEQ T. 

Thr Phe* Arg Thr 
5 

Leu Ala Leu Ser 



Pro Lys Cys Ala 

40 

Cys Lys Val Cys 
55 

Pro Cys Asp His 
70 

Thr Ala Leu Lys 
85 

Glu Tyr Asn Ser 



N0:3 : 

Leu Ala Val Ala 
10 

Thr Cys Pro Ala 
25 

Pro Gly Val Gly 



Ala Lys Gin Leu 

60 

Thr Lys Gly Leu 
75 

Gly lie Cys Arg 
90 

Arg lie Tyr Gin 
105 



Val Thr Leu Leu 
15 

Ala Cys His Cys 
30 

Leu Val Arg Asp 
45 

Asn Glu Asp Cys 



Glu Cys Asn Phe 

80 

Ala Gin Ser Glu 
95 

Asn Gly Glu Ser 
110 
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Phe Gin Pro Asn Cys Lys His Gin Cys Thr Cys lie Asp Gly Ala Val 
115 120 125 

Gly Cys lie Pro Leu Cys Pro Gin Glu Leu Ser Leu Pro Asn Leu Gly 
130 135 140 

Cys Pro Asn Pro Arg Leu Val Lys Val Ser Gly Gin Cys Cys Glu Glu 
145 150 155 160 

Trp Val Cys Asp Glu Asp Ser He Lys Asp Ser Leu Asp Asp Gin Asp 

165 170 175 

Asp Leu Leu Gly Leu Asp Ala Ser Glu Val Glu Leu Thr Arg Asn Asn 

180 185 190 

Glu Leu He Ala lie Gly Lys Gly Ser Ser Leu Lys Arg Leu Pro Val 
195 200 205 

Phe Gly Thr Glu Pro Arg Val Leu Phe Asn Pro Leu His Ala His Gly 
210 215 220 

Gin Lys Cys lie Val Gin Thr Thr Ser Trp Ser Gin Cys Ser Lys Ser 
225 230 235 240 

Cys Gly Thr Gly He Ser Thr Arg Val Thr Asn Asp Asn Pro Glu Cys 

245 250 255 

Arg Leu Val Lys Glu Thr Arg He Cys Glu Val Arg Pro Cys Gly Gin 

260 265 270 

Pro Val Tyr Ser Ser Leu Lys Lys Gly Lys Lys Cys Ser Lys Thr Lys 
275 280 285 

Lys Ser Pro Glu Pro Val Arg Phe Thr Tyr Ala Gly Cys Ser Ser Val 
290 295 300 

Lys Lys Tyr Arg Pro Lys Tyr Cys Gly Ser Cys Val Asp Gly Arg Cys 
305 310 315 320 

Cys Thr Pro Leu Gin Thr Arg Thr Val Lys Met Arg Phe Arg Cys Glu 

325 330 335 

Asp Gly Glu Met Phe Ser Lys Asn Val Met Met He Gin Ser Cys Lys 

340 345 350 

Cys Asn Tyr Asn Cys Pro His Pro Asn Glu Ala Ser Phe Arg Leu Tyr 
355 360 365 

Ser Leu Phe Asn 
370 

(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 



'4 



y 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Ala Ser Val Ala Gly Pro lie Ser Leu Ala Leu Val Leu Leu 
15 10 15 

Ala Leu Cys Thr Arg Pro Ala Thr Gly Gin Asp Cys Ser Ala Gin Cys 

20 25 30 

Gin Cys Ala Ala Glu Ala Ala Pro His Cys Pro Ala Gly Val Ser Leu 
35 40 45 

Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu Gly 
50 55 60 

Glu Leu Cys Thr Glu Arg Asp Pro^ Cys Asp Pro His Lys Gly Leu Phe 
65 70 * 75 80 

Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys lie Gly Val Cys Thr Ala 

85 90 95 

Lys Asp Gly Ala Pro Cys Val Phe Gly Gly Ser Val Tyr Arg Ser Gly 

100 105 110 

Glu Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp Gly 
115 120 125 

Ala Val Gly Cys Val Pro Leu Cys Ser Met Asp Val Arg Leu Pro Ser 
130 135 140 

Pro Asp Cys Pro Phe Pro Arg Arg Val Lys Leu Pro Gly Lys Cys Cys 
145 150 155 160 

Lys Glu Trp Val Cys Asp Glu Pro Lys Asp Arg Thr Ala Val Gly Pro 

165 170 175 

Ala Leu Ala Ala Tyr Arg Leu Glu Asp Thr Phe Gly Pro Asp Pro Thr 

180 185 190 

Met Met Arg Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala Cys 
195 200 205 

Ser Lys Thr Cys Cys Met Gly lie Ser Thr Arg Val Thr Asn Asp Asn 
210 215 220 



Thr Phe Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg Pro 
225 230 235 240 



Cys Glu Ala Asp Leu Glu Glu Asn lie Lys Lys Gly Lys Lys Cys lie 

245 250 255 



Arg Thr Pro Lys lie Ala Lys Pro Val Lys Phe Glu Leu Ser Gly Cys 

260 265 270 

Thr Ser Val Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr Asp 
275 280 285 

Gly Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu Phe 
290 295 300 

Lys Cys Pro Asp Gly Glu lie Met Lys Lys Asn Met Met Phe lie Lys 
305 310 315 320 

Thr Cys Ala Cys His Tyr Asn Cys Pro Gly Asp Asn Asp lie Phe Glu 

325 330 335 

Ser Leu Tyr Tyr Arg Lys Met Tyr Gly 

340 345 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGCGGGATCC TGCGCGACAC AATGAGCT 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGCGGGTACC AGGTAGCATT TAGTCCCTAA 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AAAGGATCCA CAATGAGCTC CCGAATC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CGCTCTAGAT TAAGCGTAGT CTGGGACGTC GTATGGGTAT TGGAACAGCC TGTAGAAG 



58 



